A variable rate multimodal speech coder with gain-matched analysis-by-synthesis
نویسندگان
چکیده
In general, a variable rate coder can obtain the same speech quality as a fixed rate coder, while reducing the average bit rate. We have developed a variable-rate multimodal speech coder with an average bit rate of 3 kb/s for a speech activity factor of 80% and quality comparable to the GSM full rate coder. The coder has four coding modes and uses a robust classification method involving the pitch gain, zero crossings, and a peakiness measure. Also the coder employs a novel gain-matched analysis-bysynthesis technique for very low rate coding of unvoiced frames and an improved noise-level-dependent postfilter. This paper describes the details of our algorithm and presents the results from subjective listening tests.
منابع مشابه
TTS based very low bit rate speech coder
This paper addresses a speech coder which uses a Text-To-Speech (TTS) synthesis system to achieve very low bit rates (sub lkbps). The main issue of the work is the accurate coding of the pitch(f0) and gain contours which are principle components of prosody. This is of paramount interest since the correct prosody will increase naturalness and an efficient coding scheme will provide high coding g...
متن کاملDesign of a Variable Rate Algorithm for the CS-ACELP Coder
In 1995, 8 kb/s CS-ACELP coder of G.729 is standardized by ITU-T SG15 and it has been reported that the speech quality of G.729 is better than or equal to that of 32 kb/s ADPCM (G.726). However G.729 is the fixed rate speech coder, and it does not consider the property of voice activity in mutual conversation. If we use the voice activity, we can reduce the average bit rate in half without any ...
متن کاملOverview of Code Excited Linear Predictive Coder
Advances in speech coding technologies have enabled speech coders to achieve bit-rate reductions at a great extent while maintaining roughly the same speech quality. One of the most important driving forces behind this feat is the analysis-by-synthesis paradigm. Code Excited Linear Predictive coder (CELP) is the quite efficient closed loop analysis-by-synthesis method for narrow and medium band...
متن کاملEnhanced Waveform Interpolative Coding at 4 kbps
This paper presents an Enhanced Waveform Interpolative (EWI) speech coder at 4 kbps. The system incorporates novel features such as analysis-by-synthesis (AbS) vector-quantization (VQ) of the dispersion-phase, AbS optimization of the slowly evolving waveform (SEW), a special pitch search for transitions, and switched-predictive analysis-by-synthesis gain VQ. Subjective quality tests indicate th...
متن کاملAnalysis-by-synthesis multimode harmonic speech coding at 4 kb/s
This paper presents a 4 kb/s Analysis-by-Synthesis Multimode Harmonic Coder (AbS-MHC). Novel features of this coder include a signal modification technique that allows time-domain analysisby-synthesis parameter estimation in sinusoidal coding framework, and a frequency-domain transition speech model with improved parameter estimation and quantization schemes. An efficient quantization scheme fo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1997